A Database of On-Line Handwritten Mixed Objects Named "Kondate"
نویسندگان
چکیده
This paper describes a database of on-line handwritten patterns mixed of text, figures, tables, maps, diagrams and so on. Now, pen-based and touch-based interfaces are spreading into people and their surfaces are getting large. People can write and draw mixed objects without paying attention on the difference of objects or the mode change. Moreover, they may write text in any direction in combination with non-text objects on large surfaces. This is clearly one of the largest advantages of pen or touch interfaces but poses a challenging problem of object classification and recognition. The proposed database is made and now being enlarged to study such subjects more extensively. So far, 100 Japanese writers, approximately 25 English and 45 Thai writers have participated. The database stores on-line handwritten (digital ink) patterns with ground-truth tags in InkML. Keyword On-line Handwriting Database; Digital Ink; Text/Non-text; Figure; Table; Map; Diagram; Horizontal Text; Vertical Text
منابع مشابه
Development of a Robust and Compact On-Line Handwritten Japanese Text Recognizer for Hand-Held Devices
The paper describes how a robust and compact on-line handwritten Japanese text recognizer was developed by compressing each component of an integrated text recognition system including a SVM classifier to evaluate segmentation points, an on-line and off-line combined character recognizer, a linguistic context processor, and a geometric context evaluation module to deploy it on hand-held devices...
متن کاملCollection and Analysis of On-line Handwritten Japanese Character Patterns
This paper describes our second collection of on-line handwritten character patterns and their analysis. 163 writers presented about 10,000 character patterns, covering 4,438 categories mainly in the context of sentences. Together with our first collection, the Kuchibue database containing 12,000 patterns from 120 writers, we have now collected about 3 million patterns. For this second collecti...
متن کاملOff-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model
In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...
متن کاملOn-line Handwriting Recognition for Creative Human Interfaces
This paper takes handwriting-based human interfaces as human-centered and creative human interfaces and considers the directions of research and development on on-line handwriting recognition. Then, it summarizes our research on collection of sample pattern database, combination of on-line and off-line recognition methods, a model and implementation of format free handwriting recognition, segme...
متن کاملMinimum-risk training for semi-Markov conditional random fields with application to handwritten Chinese/Japanese text recognition
Semi-Markov conditional random fields (semi-CRFs) are usually trained with maximum a posteriori (MAP) criterion which adopts the 0/1 cost for measuring the loss of misclassification. In this paper, based on our previous work on handwritten Chinese/Japanese text recognition (HCTR) using semi-CRFs, we propose an alternative parameter learning method by minimizing the risk on the training set, whi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014